Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yuki Ono

LF-Net: Learning Local Features from Images

May 24, 2018

Yuki Ono, Eduard Trulls, Pascal Fua, Kwang Moo Yi

Figure 1 for LF-Net: Learning Local Features from Images

Figure 2 for LF-Net: Learning Local Features from Images

Figure 3 for LF-Net: Learning Local Features from Images

Figure 4 for LF-Net: Learning Local Features from Images

Abstract:We present a novel deep architecture and a training strategy to learn a local feature pipeline from scratch, using collections of images without the need for human supervision. To do so we exploit depth and relative camera pose cues to create a virtual target that the network should achieve on one image, provided the outputs of the network for the other image. While this process is inherently non-differentiable, we show that we can optimize the network in a two-branch setup by confining it to one branch, while preserving differentiability in the other. We train our method on both indoor and outdoor datasets, with depth data from 3D sensors for the former, and depth estimates from an off-the-shelf Structure-from-Motion solution for the latter. Our models outperform the state of the art on sparse feature matching on both datasets, while running at 60+ fps for QVGA images.

Via

Access Paper or Ask Questions

Learning to Find Good Correspondences

May 21, 2018

Kwang Moo Yi, Eduard Trulls, Yuki Ono, Vincent Lepetit, Mathieu Salzmann, Pascal Fua

Figure 1 for Learning to Find Good Correspondences

Figure 2 for Learning to Find Good Correspondences

Figure 3 for Learning to Find Good Correspondences

Figure 4 for Learning to Find Good Correspondences

Abstract:We develop a deep architecture to learn to find good correspondences for wide-baseline stereo. Given a set of putative sparse matches and the camera intrinsics, we train our network in an end-to-end fashion to label the correspondences as inliers or outliers, while simultaneously using them to recover the relative pose, as encoded by the essential matrix. Our architecture is based on a multi-layer perceptron operating on pixel coordinates rather than directly on the image, and is thus simple and small. We introduce a novel normalization technique, called Context Normalization, which allows us to process each data point separately while imbuing it with global information, and also makes the network invariant to the order of the correspondences. Our experiments on multiple challenging datasets demonstrate that our method is able to drastically improve the state of the art with little training data.

* CVPR 2018 (Oral)

Via

Access Paper or Ask Questions